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\ Abstract 

In this paper, we study the linear structure of sets A C F£ with doubling constant cr(A) < 2, 
where a(A) :— ■ I n particular, we show that A is contained in a small affine subspace. 

We also show that A can be covered by at most four shifts of some subspace V with \V\ < \A\. 
Finally, we classify all binary sets with small doubling constant. 



1 Introduction and Statement of Results 



o 

u 

Let A be a finite subset of an abelian group G, and define the doubling constant o~{A) of A to be 



\A\ 



where A + A is the collection of all sums a + a! with a, a' E A. 
CN \ Suppose that A is nearly closed under group addition in the sense that o~(A) < K for some 

small K, what can be said about the structure of A? It is easy to see that such sets will possess 
good linear structures. In particular, they will be cosets of subspaces or large subsets of them. In 
\ the 1970s, Freiman obtained the following celebrated theorem for G = Z [3J: 

o 

Theorem 1.1 (Freiman). If A C Z and if \A + A\ < K\A\, then A is contained in a proper 
arithmetic progression of dimension d and size at most s\A\ such that d and s only depend on K . 

In general, B. Green and I. Z. Ruzsa proved similar results for arbitrary abelian groups [6]. 
The precise statement in [6] is the following: 

Theorem 1.2 (B. Green and I. Z. Ruzsa). Let A C G satisfy \A + A\ < K\A\. Then A is 
contained in a coset progression of dimension d{K) and size at most f(K)\A\. One may take 
d{K) = CKHog 2 {K + 2) and f{K) = exp (CK 4 log 2 (K + 2)) for some absolute constant C. 

As we see in the statement of this theorem, the analogue of arithmetic progression in an 
arbitrary abelian group is coset progression. By a coset progression of dimension d we mean a 
subset of G of the form P + H, where H is a subgroup of G, and P is a proper progression of 
dimension d. 

The theorem above gives the best bound known for an arbitrary group. However, better es- 
timates can be derived for specific abelian groups. We consider the case where G = which is 
particularly interesting because of its usefulness for theoretical computer science. See [H HI [11] for 
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more information. 



To study the structure of small doubling subset of B. Green and T. Tao introduced the 
following definition of F(K) and G{K) [5]. 

Definition 1.1. Define F(K) to be the least positive constant such that for any m € Z + and any 
non-empty set A C F™ with o~(A) < K, A is contained in an affine subspace V Q of cardinality 
\V\<F(K)\A\. 

Definition 1.2. Define G{K) to be the least positive integer such that for any m € Z + and any 
non-empty set A C F™ with a (A) < K, there exists a linear subspace V C F™ of cardinality 
\V\ < \A\ such that A is covered by at most G{K) translates of V . 

In the past five years, the bounds for the value of F{K) and G{K) have been continually 
improved. Some of the best results so far are listed below. 

Theorem 1.3 (B. Green and T. Tao [5]). 

F(K) = 2 2K+0( ^ l °3 K ). 

Theorem 1.4 (A quick result follows from [7, Corollary 1.5] and Ruzsa's covering lemma [10, 
Lemma 2.14]). 

G{K)<^K°^ K \ 

Theorem 1.5 (Deshouillers, Hennecart and Plagne [2]). For 1 < K < 4, 

F <*> s u^k-i- 

In [5], the authors give the following formulae for F(K) and G(K) in the region 1 < K < |: 
Proposition 1.6 (B. Green and T. Tao [5]). 



F(K) = 

Proposition 1.7 (B. Green and T. Tao [5 

G(K) = 



K 1<K<{; 
%K 7 - < K < 5 

7 v 4 — xv ^ 5' 



2 K^<I; 

3 1<K<\. 



In this paper, we will extend these results by determining the exact value of F(K) and G(K) 
in the region 1 < K < 2, and thus classify all small doubling sets. In particular, we will prove the 
following: 



Theorem 1.8. 

F(K) 



K 1<K<1; 
jK \ < K < 2. 
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Theorem 1.9. 

f 2 l<K< 7 r, 
G{K) = \ 3 1<K<%; 

( 4 § < K < 2. 

To this end, we need to study the structure of normal sets, i.e., those sets T C F™ with 
|T + T\ = 2\T\ — 1. The definition will be carefully introduced in section 2. Two other key 
definitions, normal numbers and rank of a normal set, will also be given in the same section. 

In section 3, we give a formula of F(K) in terms of normal numbers for | < K < 2. By a 
similar method, we derive an estimate for G(K) in section 4. 

In section 5, we determine all normal numbers and their corresponding ranks via Kemperman- 
type theory, thus determining the exact value of F(K). Being more careful, I prove some better 
bounds for G{K) in section 6, which will determine the exact value of G(K) for 1 < K < 2. This 
will complete the proof of Theorem 1.8 and Theorem 1.9. 

In the last section, we will discuss how the small doubling subsets of F?? are intrinsically related 
to the normal sets, and then completely classify the small doubling sets with cr(A) < 2. 



2 Definitions 

Definition 2.1. (Normal numbers) A positive integer n > 4 is called a normal number if there 
exists anmeZ+ and a set T C Ff such that \T\ = n and \T + T\ = 2n - 1. Such a set T is called 
a normal set of level n. 

There are infinitely many normal numbers. For instance, consider T = {{a\,a 2 , ...,a m ) € 
F™ +1 |ai = 0, a 2 ,a 3 ,...,a m not all l'a} U {(1,0,0, ...,0)} then \T\ = 2 n and \T + T\ = |F™ +1 - 
{(1, 1, 1, 1)}| = 2 n+1 — 1. This means 2 n (n > 2) is a normal number. On the other hand, not 
all numbers are normal. For example, 5 is not normal. 

Let S denote the set of all normal numbers, and S n denote the set of all normal sets of level 
n. Write S = {n\ < n 2 < n% < ...}. n-i is called the i-th normal number. In particular, n\ = 4. 

Let us next recall the definition of Freiman s -isomorphism. 

Definition 2.2. (Freiman isomorphism) Let s > 2 be an integer. Let G, G' be two abelian groups 
and let A C G and A' C G' be subsets. A map <p : A — > A 1 is called a Freiman s-homomorphism if 
whenever a±, a s , b±, b s € A satisfy 

ai+ a 2 + ... + a s = b\ + b 2 + ... + b s 

we have 

0(ai) + <t>{a 2 ) + ... + cf){a s ) = 0(&i) + <K& 2 ) + - + <KM- 

If is also a Freiman s-homomorphism, then we say that <fi is a Freiman s -isomorphism, and 
write A = s A'. 

Now suppose T is a normal set of level n and T', a subset of F™, is Freiman 2-isomorphic to 
T. Then |T'| = \T\ = n, \T' + T'\ = \T + T\ = 2n — 1. Hence, X" is also a normal set of level n. 
This motivates the following definition. 
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Definition 2.3. (rank of normal set) Let T be a normal set of level n. The rank of T is the least 
integer m such that there exists a set T' C F™ which is Freiman 2-isomorphic to T; i.e. T" =2 T. 
Denote the rank of T by i?(n, T). 

Lemma 2.1. For any n € 5 and T £ S n , we aaue R(n,T) < n. 

Proof. Let t>i,t>2, ■ ■ ■ ,v m {m < n) be a maximal collection of linear independent elements in T. 
Then the map <j> : Vi 1— ► ej £ F™ induces an Freiman 2-isomorphism from T to 0(T) C F™. □ 

Definition 2.4. For n € 5, define C(n) = sup<p g g n R(n,T). 

The existence of C(n) is guaranteed by Lemma 2.1. 

3 A Formula for 

Proposition 3.1. Let n« denote the i-th normal number. Then 

2 c( nj ) 

F(K) = max K 

l<j<i 2nj — 1 

■f 2rii~l < K 2n l+1 -l 
y n, ^ ^ ^ ni+1 ■ 

In this section, I will prove Proposition 3.1 by showing F(K) > maxi<j<j ^JtzjK and F(K) < 
maxi<j<j 2n _ 1 K, respectively. 

3.1 The Lower Bound 

It suffices to prove that 



F(K) > K. 

K ' ~ 2rij - 1 

when K > — — - 

- n. 

By definition, there exists a normal set T C F^ nj ' ) of level rij such that C(n,) = R(nj,T). It 
follows that T is not contained in any non-trivial affine subspace of ¥2 \ Let A be a random 
subset0 of T x Fj c ( n ^ w ith n large of density close to -^rjr-- Then the smallest affine subspace 
of Fg containing A is itself. 
Therefore, 

nn oC( n j) 

F(K) > - = = K 



x By a random subset A we mean a set without too much linear structure such that A + A covers the whole set 
(T + T) xF2 _C(nj) . 
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3.2 The Upper Bound 

Here I follow B. Green and T. Tao's idea in [5]. 

Suppose K < 2n ^ +1 1 1 • Let H be the largest subspace of F 2 such that A +A is a union of cosets of H . 
From Kneser's theorem (see [10, Theorem 5.5]) we have \A + A\ > 2\A\ — \H\. Since K < 2n ^" 1 , 
the inequality gives \A + A\ < (2n i+1 - 1)\H\. If we write B := (A + H)/H C F^/iT, then B + B 
has cardinality at most 2nj+i — 2. Moreover, B + B cannot be expressed as the union of cosets 
of any non-trivial subspace of WQ/H. Kneser's theorem gives \B + B\ > 2\B\ — 1 which implies 
< m+i - 1. 

On the other hand, it is clear that 1 rgi 1 < K < 2. Together with Kneser's theorem, we get 
\B + B\ = 2\B\ — 1. Hence -B is Freiman 2-isomorphic to a normal set. In particular, \B\ = rij for 
some j £ {1, 2, i}. 

C(n ■] 

It follows that t4 can be covered by the a subspace isomorphic to F 2 3 x H with cardinality 
2 C M\H\ < f^K- \A\. Therefore, 

2 c ( n j) 

F(K) < max K. 

i<i<i 2rij - 1 



4 An Estimate for G(K) 

Instead of a precise formula, I give the following estimate for G{K). 
Proposition 4.1. Let denote the i-th normal number. Then 

max(C7(n,) + 2 - riog 2 n/|) < G(K) < max 2 c '^)-Li°g 2 K-i)J 

jj- 2nj-l < ^ < 2n i+1 -l 
J rii — n i+1 



4.1 The Lower Bound 

2n ■ — 1 

Suppose — 1 — < K < 2, we show that 

rij 

G{K) > C(nj) + 2- |~log 2 nj~\. 

Consider a normal set T C F^ 7 of level nj satisfying R(nj,T) = C(nj). Write m = C(nj). Under 
a proper linear transformation, we may assume that {0, e±, e2, e m } C T. Let A be the Cartesian 
product (in F™ x F 2 = F 2 n+n ) of T together with a random subset yl' in F 2 of density close to 
n ] .K ■ (When choosing A' , let {0, e m +i, e m +2, e m + n 

Suppose A is covered by the union of a\ + V, 0,2 + V, ai + V where V C F 2 n+n is a linear subspace 
of cardinality |V| < Then 

dim V < log 2 L|^4|J < n - 1 + [log 2 n^] . 

Assume that a\ = 0. Let M t := (a t + V) n {ei, e2, e m+n }, t = 1, 2, 

Take a representative element e(i) from each set Mj (If Mt = 0, simply set e(t) = 0). Since V is a 
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linear space, the difference of two elements in the same Mj belongs to V. Hence 



Mi U ((J{e - e{t)\e G M t \e(t)}) C V. 



t=2 

However all the elements in this set are linearly independent. So 

l i 



dim V > \Mi\ + J^|{e - e(t)\e G M t }\ > \M X \ + ^(|M t | -l) = m + n- i + l. 

Z > C(nj) + 2 - [log 2 n 3 ] 



t=2 t=2 

Therefore, 



4.2 The Upper Bound 

Suppose K < 2 "' +1 — -. Let -ff and B be the same as in §3.2. 
Then 

m < _J^ W < -J_ w . 

2rij — 1 rij — 1 

Note that j4 is contained in some linear space F isomorphic to x H and dimF = C(rij) + 

dimH. 

Let / = Llog2(^-j — 1)J and let H' be a linear subspace of F containing H with dimH 1 = dimH + /. 
Then \H'\ < \A\ and F is the union of 

2(iim F— i— dim H c^m—l c^Cirij)— Llog 2 (rtj — 1)J 

cosets of H' . Therefore, 

G(K) < max 2 p ( n ^-L 1 °S2K- 1 )J. 
i<j'<j 

5 Structure of Normal Sets 

In [8], Kemperman describes the structure of subsets A, B of an abelian group G satisfying [A+-B| = 
|^4| + \B\ — 1. In particular, if we set A = B and G = FJ, then Kemperman's theorem gives the 
structure of normal sets. 

In the language of [9], V. Lev proved the following special case for G = F£: 

Theorem 5.1 (V. Lev [9]). Let r > 1 be an integer. If a subset iCFJ satisfies \A + A\ < 2\A\, 
then one of the following holds: 

(i) there exists a subgroup H < F£ such that A is contained in an H-coset and \A\ > \H\/2; 

(ii) there exists two subgroups F,H< Fg, satisfying \F\ < 8 and F D H = {0}, and an aperiodic 
antisymmetric subset F\ C F, such that A is obtained from a shift of the set F\ + H by removing 
less than \H\/2 of its elements. In this case A + A is the sum F © H with one H-coset removed, 
so that \A + A\ = (\F\ - 1)\H\. 

Based on V. Lev's result, we can determine all normal numbers and the corresponding ranks. 

Theorem 5.2. Let m denote the i-th normal number. Then 
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. n % = 2 i+1 ; 
• C(ni) =i + 2. 

Proof. Let T C F 2 ' be a normal set of level n. Then T satisfies the condition in Theorem 5.1. 
If (i) holds, then H must be ¥ R{n ' T) itself. However, in this case, \T + T\ = 2\T\ - 1 > 2 R( - n > T \ a 
contradiction. 

If (ii) holds, then \T + T\ = (\F\ - 1)\H\. Note that \H\ is a power of 2 and \T + T\ is an odd 
number. So, \H\ = 1. It follows that T is contained in a shift of F which must be the whole space 
¥ R{n ' T) . So 2n - 1 = \T + T\ = \F\ - 1 = 2 R ^ - 1. Thus, n is a power of 2. 
Recall that 2 n (n > 2) are normal numbers. We obtain n« = 2 %+1 . 
Furthermore, i?(n«, T) = 1 + log 2 n« = i + 2, which is independent of T. 

Therefore, C(n;) = i + 2. □ 

r>C(n ■) 

Proof of Theorem 1.8. By Theorem 3.1, it suffices to compute 2n -\ • ^ ac ^' 

2 c ( n i) 2 i+2 2 3 8 

2 nj - 1 ~ 2 f + 2 - 1 ~ 2 3 - 1 ~ 7 

which implies that = f K for | < i^T < 2. □ 



6 The Exact Value of 

For any j > 1, we have that C{rij) + 2 — |~log 2 rij] = j + 2 + 2 — (j — 1) = 3, and that 

C{rij) - Uog 2 (ni - 1)J = {j + 2) - j = 2. By Proposition 4.1, we have 3 < G{K) < 4 for 
1<K<2. 



6.1 When \ < K < § 

Proposition 6.1. When \ < K < § ; = 3. 

Proof. Let .ff and B be the same as in §3.2. In this case, we have \B\ < 8. 

If \B\ < 4, then A is contained in the union of no more than four cosets of some subspace H, where 
\H\ < Covering these four cosets of H by three cosets of a subspace of one dimension higher 
than H we obtain G(K) < 3. 

If \B\ = 8, under a proper linear transformation, we can assume that {0, ei, e2, e3, 64} C 5. Let u 
denote the single element of F 2 \(-B + B). Note that 0, ej, ej + efc G i? + B (1 < i, j, fc < 4). So u 
contains three or four l's, Without loss of generality, we only consider u = (1110) or (1111). In 
either case, it is straightforward to check that B can be covered by three shifts of a linear space 
with four elements. If follows that A can always be covered by three cosets of a linear subspace 
with cardinality no more than \A\. □ 

6.2 When § < K < 2 

Proposition 6.2. When § < K < 2, G{K) = 4. 
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Proof. It suffices to prove that G(K) > 4. 

Let T C Fj be the set of all 5-tuples with no more than two l's. Then T is a normal set of level 5. 
Let A be the Cartesian product (in ¥ 2 x¥ 2 = F^ 5 ) of a random subset A' in ¥ 2 of density close 
to together with the set T. (When choosing A, let {0, e±, e„} G A') 

Now suppose ^4 is covered by the union of V, v± + V and v 2 + V where V is a linear subspace with 
I V| < |j4|. Then there exists i,j G {1, 2, n + 5} such that G + V and ej £ V2 + V (Otherwise, 
V contains n + 4 linearly independent elements). However, + ej is contained in v\ + t>2 + V which 
is disjoint from A, a contradiction. □ 

This completes the proof of Theorem 1.9. 

7 A Complete Classification of Small Doubling sets 

Following the discussion in §3 and §5, we have 

Corollary 7.1. For any normal set T, there exists an m G Z + and a normal set T' C F™ such 
that 

• V = 2 T; 

• \T\ = 2 m - 1 ; 

• {0,ei,e 2 ,...,e m } C T'. 

We call such sets T' elementary normal sets. 
Combining Corollary 7.1 and the discussion in §3.2, we can describe the structure of small doubling 
sets in the following way: 

Proposition 7.2. Suppose A is a subset of F?> with a(A) < 2. Then there exists an integer 
m < [log 2 ( 2 _^ ( -^- ) )] , an elementary normal set T of level 2 m ~ 1 and another integer k G Z + such 
that A is Freiman 2-isomorphic to a random subset ofTxW^ with density 2 2 ™o(A) ' 

Proof. Since B is a normal set, it is 2-isomorphic to an elementary normal set T of level 2 m . 
H is a subspace of ¥ T 2 l ■ So H is isomorphic to F| for some k. It is easy to compute that the density 
of the random subset in T x is • Since densities cannot exceed 1, we have 2 2m a{A) — ^ 

Thus, m< [\ og2 (^^)\. □ 

Therefore, we intrinsically classify all finite binary set with doubling constant cr(A) < 2. Fur- 
thermore, we conclude that: 

Principle 7.3. When studying the linear structure of small doubling binary sets in the sense of 
o~(A) < 2, it suffices to consider elementary normal sets. 

Remark: The discussion above gives us a general idea for studying the exact value of F(K) 
and G(K) when K > 2: we can try to give definitions of generalized normal sets and study the 
linear structure of them. At least, the following conjecture is reasonable. 

Conjecture 7.4. For K > 1, 

• F(K) is a piecewise linear function; 

• G(K) is a piecewise constant function. 
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